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I. Introduction: 


To gain information about the distribution, from statistics of 
a sample from that distri'b'Ution, remains one of the important goals of 
investigation in Mathematical Statistics. 

The investigation initiated here is concerned with drawing con- 
clusions about the characteristics of the unknown part of a certain 
collection, from knowledge of some characteristics of a suitable selected 
part of the collection of elements. 

We note here in general discussion classical results obtained in 
the area in which we are concerned without giving precise and due credit 
to those great scholars whose products we use. This is done mainly because 
the contributions to a basic idea are numerous and it is difficult to either 
list all of them for a specific idea or to list the principal person re- 
sponsible. 

We are interested in different characteristics of the elements 
of the population under investigation. 

When v;e say that the population has the distribution F(x), vie 
mean that we are investigating a characteristic X of elements of this 
population and that this characteristic X is a random variable v;ith the 
distribution function F(x). 

Let the characteristic X of elements of a population be a random 
variable with the distribution function F(x). This suggests a need for the 
definitions that are essential in what follov7s. We use the classical con- 
cept of Fisz. 

In general, let e denote an elementary/ event of- a set E of ele- 
mentary events. On set E we define a single-valued real function X(e) 



2 


such that the probability that this function will assume certain values 
is defined. 

For a precise formulation of the conditions which are to be 
satisfied by X(e) we introduce the notion of inverse image. 

Let X(e) be a single-valued real function defined on the set E 
of elementary events. The set A of all elementary events to vrhich the 
function X(e) assigns values in a given set S of real numbers is called 
the inverse image of the set S, 

It is clear that the inverse image of the set R of all real 
numbers is the whole set E. 

A single-valued real function X(e) defined on the set E of ele- 
mentary events is called a random variable if the inverse image of every 
interval I on the real axis of the form (- oo, x) is a random event. 

We shall set the probability that the random variable 

X(e) takes on a value in the interval I equal to the probability P(A) of 
the inverse image A of I, 

(The notion of a random variable corresponds in the theory of 
real functions to the notion of a function measurable vjith respect to the 
field of sets being considered.) 

It follows that if a random event A is the inverse image of a point 

X, the probability that the random variable X takes on the value x equals 

Cx ^ 

the probability of the event A, P (X = x) = P(A). 

The function P^^\s) giving the probability that a random variable X 
takes on a value belonging to S, where S is an arbitrary Borel set on the 
real axis, is called the probability function of X. P^^^(S) = £ S). 

The function F(x) - P(X ^ x) is entitled the distribution func- 
tion of the random variable X, It r ow follows that the probability distri- 
bution of a random variable X is determined by its distribution function. 

(VJe note that a randoro variable is a function and not a variable 
in the sense wiiich is usually understoed in mathematical analysis.^ 
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II« Characteristic function: (Ch. fu. ) 

• For X a random variable and F(x) its distribution function, the 

function .+.« 

f(t) = E (e^^^), 

where t is a real number and i is the imaginary unit, is called the charac- 
teristic function of the random variable X, 

/)^ 
itX 

e dF(x). 


We list without discussion some classical theorems about ch, fu. 


(a) If the moment of a random variable exists, it is expressed 


by 


L. s f 


(J) 


( 0 ) 


Where f'^(O) is the derivative of the ch, fu. of this random variable 
at t s 0, 

(b) If fCt, u), (t) and f 2 (M) denote the ch. fu, of the ran- 

dom variables (X,Y), X and Y, respectively, then the random variables X 
and Y are independent iff the equation 

f(t,u) = fi<t) f CiO 
2 

holds for all real t and^, 

(c) Convergence of ch. fu. implies convergence of the correspond- 
ing distribution functions, i.e,, distribution functions •'depend continu- 
ously’' on their ch, fu, 

(d) Tvjo distribution functions Fj^Cx) and F^Cx) are identical if, 

and only if, their ch. fu, f, (t) and f (t) are identical. 

^ 2 

(e) If a ch. fu, f(t) is absolutely integrable over (-oo, oo) 
then the corresponding distribution function F(x) is absolutely continuous 
and 


f1(x) = 

27T 


,.cc 

^-itX 




fe?- -:'T 





(f) Let f(t) be an arbitrary ch. fu. For every real x the limit 


P = 

X 


lim 1 
2T 


e-itXf(t)dt 


exists 


and is equal to the saltus of the distribution function of f(t) @ the point x. 


(g) Let jf (t)f be a sequence of characteristic functions and 

L n ) 

suppose that this sequence converges for all values of t to a limit function 
f(t). Assume that f(t) is continuous @ t = 0; then f(t) is also a ch.fu.- 

(h) A dorap lex- valued function of a real variable t is a ch, fu. 
if, and only if, (i) f(t) is non-negative definite and (ii) f(0) = 1, 

(i) Let f(t) be a real-valued and continuous function which is 
defined for all real t and v;hich satisfies the following conditions: 

(i) f(o) = 1 

(ii) f(-t) = f(t) 

(iii) f(t) convex f^or t 0, i.e. f(^^ ^ ^ 

2 2 


for all tj^7>0,t2 > 0. 

(iv) lim f(t) =0. 
t oo 

Then f(t) is the ch. fu, of an absolutely continuous distribution F(x). 


Ill, The Quadratic Regression Problem 

The concern is with two random variables X and Y vdth the condi- 
.tional expectation of Y given X is given by 

(l) E(Y^) = Bq + X 4-”' + holding almost everywhere. 

We say that Y has polynomial regression of order K on S. 

V?e make assumptions that the first moment of Y and the moment of order K 
of X exist . 

It follows from the d^^finition of expected values and (1) that 
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t 


E(Y) = E(X) + + Bj^E(x^) 

i.e. E(E (y/x)) = E(Y). 

For the proper values of BJ, and K, we use the terms quadratic 
regression and linear regression, " 

We use the term constant regression of Y on X if ECY^X) = E(Y). a,e. 
Now, for an n-sample of independently and identically distributed 
random variable from population with distribution function F(x) we 


symbolize the statistics 


= 2x, , = n X 


S = S(Xj^ Xjj) some other statistic* 

We are concerned with the task of finding a statistic which has con- 
stant regression on A. And conversely, this property will .-sometime determine 


the population. 
Consider 




^ A. X. X. + ^ b. X 


and some population which has the property that Q has quadratic regression 


On A. 


We must necessarily distinguish cases defined in terms of relations 

between coefficients A- and b. of Q and the regression coefficient B. . 

1 1 1 

The Demonstration is that the population is characterized by the 
property indicated. 


We need 

Lemma! A If 

i) X, Y are random variables 
ii) E(Y), E(X^) exist for K 0 (K is an integer), 

then Y has polynomial regression of order K on X if, and only if, 

(3) E(Ye^^^) = 5 B E(X®ei'*^^) 


hold for all real t. 


J 


i 


I 
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Proof: of 

Necessity: 

Assume Y has polynomial regression of order K on X, i.e, 

E(y/x) = + B^x t — Bj^y’^. a.e. 

multiply (1) by e^^^ 

eitXE(yyx) = |^Bq t B^ X + Bj^x’^J 

Take Expectation of both sides: 

E E(y/x)j = E/ei“ B3X^J j 


ECe^t*) E 


[e(t/x)7 = 

E(e^^^) E (Y) = ^ \ Erx®e^‘‘^^| 

^ 5 rO ^ V 

E(Ye^^-) = 2 

S^c ^ ^ 


h K 


J 


To prove Sufficiency: Assume that (3) is valid for all t. 


Then: 


^^itx 
e 


' [y B^X-jj = 0, 


This may be written as: 

oo ^ 

itx 


(4) y e-’ E 


^ '■i 

' 5s© 


(x) = 0 


where F (x) is the marginal distribution of X. 
1 


Let Pjj(A) be the probability function of the random variable X. This is a 

set function defined on all borel sets of R^, Then we can write for equa- 
tion (4): 


itx _ 
e E 




(y -2 b/ I 

5 "S o / 


x) d P = 
X 


0 



i 


1 


1 


J 


i 


Let (A) = J<’- 2 B X® y x) dP , This is a function of 

bounded variation v;hich is defined on all Borel sets A of We see that 


(■ 


eitxd ^ = 0 


Since the uniqueness theorem for ch. functions is valid for the Fourier 
•transforms of functions of bounded variation we conclude that 


(A) = = 0 for all Borel sets A, 

This is only possible if 



IV, The Differential Equation for the Ch, Fu, 

To return to the tone of the introduction, here we add these comments: 
Cheir act erizat ions of both discrete and continuous distributions 
are obtainable in a variety of vjays. As an example of this for a discrete 
distribution, it is knovm that the relationship of equality of mean and 
variance characterizes the Poisson distribution. 

It is possible also to characterize distributions by reason of one 
linear statistics. And the independence of tvfO statistics can be used to 
characterize distributions. 

Also, various populations have been characterized by using the property 
that one suitably chosen statistic has constant or quadratic regi'ession on 
another. Much work in this field has been done by Lukacs and by Laha. 

We are here concerned with a case of quadratic on linear. 

Let Xj^ Xg — — x^ be a sample of size n from a population with distri- 
bution function F(x), Assume that the second moment of F(x) exists. 

. ■ 

f(t) = / e^^^‘dF<x) ■ 

is the ch. fu. of F(x), 
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Consider the quadx*atic statistic: 

Q = ^^OijX.X. + ^ b X . 

4^*1 ^ J«| ^ 1C 

and suppose that Q has quadratic regression on A = V X-i, 

i.e. (5) E(Q^A) = Bq + B^A f B 2 A^ 

it A 

If we multiply both sides of (5) by e and recall this step in the proof 
of the Lemma, we have 

(6) ECQe^"^^) = + B^ECAe^"*^^) + BjECA^e^^^) 

To simplify computation let 
ai * 

•fe 

b = . 4L. b. ; thus changing the symbols for coefficients 

in the value of Q. We also note for convenience; 

f(t) = E(e^''^’') 

f*(t) = i ECxe^^’*) 
f"(t) s - ECX^e^^^) 
f‘”(t) = - i E(X^e^^^) 

ECXe^^’*) = i f’(t) 


E(xV’'^^) = -f»(t) 

= i f”'(t) 


From a well-known property of ch. fu, , in some neighborhood of the origin 
f ( t ) is different from sero so that wc can i-frite 

/(t) = In fCt) 
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We then have 


f”(t) = /"(t) + 

f(t) fit) ‘■ 


The left-hand side of (6) is: 

k 


E(Qe^’^*) = E r a. X.X b.X ) = 

E feA.X? +|ii' atj X X *2 b.X ] 6^**1 . 

<V, ^ 3 J.y :) J ^ 

^ 2 itx. 

Z a^ECXje .3,ECa _2, 

(-•f 


Set the result = + Eg. Then ^E 


// 




fj-i 




3_ z_i '7 

L^i ■J--! 


- 2 2 
r 


>*»- 


s.- 


-- kC-i^‘(*). 




n-‘ 


Thus: the left-hand side of (6) is Ej^ ''■ ^5 ^ ^3 ” 


ffCtjf" Ui 

f"(t)»t + 3^ lfi{t)]2 + tb f^(t)l 

L 

f(t) [f(t)]2 f(t) J 


and this expression, by ( 7 ) is 


(8) - |^f(t)J’^ ^ /"(t: (a^ + a2> |j2^^(t)J ^ + 1^ b ^^(t) 




The right»hand side of (6): 


Bq- + B2E(A^ 

K 

Recalling that A = Xi, 

Bo E(e^'*^^^ = 

^ E(Ae^^^t) = E 

0 

s - i B^ n f^(t) ^ 

= ^2 ^ {2 ® ^ 


e V e 




= B> 


®2 E X.Xje*'*^ 

1^- /f”(t) [f<t)J ^ 1 + n(n - r):[T;i F^(t^^ j^f(t)jCP 


= - [f(t)| B { i) f"(t ) - [fHtf + n^ f^(t) 1^1 
I 1- f(t) [f(t)]^ •• -f(t) J J 


= - B2 ^”(t) + n^ [/^(t) 



So the right-hand side is: 

- |n B^ /•’ (t) + n 2 B 2 J/Ht)J ^ + n t /^(t) - BoJ 

The equation (6) is now: 

®1 ^ +tb j^^(t) = 

n B^ /"(t) + n^ [/^(t)|,^ + In B^^ ^'^(t) - B^ 

We can solve for Bq 
B o s (n B^ - a^) y^(t) + (n^B^ - " ®2^ ^ ■*■ ^ (nB^ - b) 


(t) 
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And if we substitute convenient symbols for the coefficients— we have; 

(9) <)^/"(t) +1(^3 /^(t) = 

Introduce the function; 

s(t) = -r/Ht) 

Equation (9) above becomes; 

S^(t) - ^ s(t) = Bq or 

(10) i a. ^ S + Bo 

c<t a i 

So our function S(t) is such that 

S(0) ='^/'-(0) and 

(11) ^ = 1 /”(0) = i, /T ^ 

dt V 

t=0 


Here, and ^"^are the mean and variance respectively of the distribution 

F(x). 

Now, from equation (10), 

We get, by substituting the values above gotten by putting t = 0, ari^ equation 
(1) a relation between the coefficients of Q and the regression coefficient; Ij-.^ 

(12) - ^ ^ + Bo 

We see, from (12) that if c). = 0, while at least one of^ andc^ is dif- 

ferent from 0 then S(t) and j/^(t) are constants so that vre obtain a degenerate 

distribution. V/e, thus, may assume without loss of generality that ?! 0. 

1 

Let 2 = 0,4^ ^ 0, 4 = 0 . 


These conditions will support this theorem; 

Let X 3 sample of size n taken from a population v?hich huS a 

%■ 

finite variance ^ , Now consider the quadratic statistics : 



I 


1 


I 


a = a^j^Cn - 1) - a2 ^ 0 


^1 = i 

4* • / 


U 


*' 


Let and B2 be two real constants: 


l'£ *>3 

s>»-/ 


B, = 


^2 = -| (a^ t a^), 

n^ 


The relation: 


E(q/a) = Bp + Bj^A + B2A" 


holds a.e, iff the follovjing conditions hold; 


(1) B, 


= 4-^ f . 


(n - 1) - a. 




(2) The population is normal. 


The relations; (i) a (n - 1) - a„ ^ 0 

1 2 

(ii) B^ = 4 i- 


n 


(ill) B2 - (a, + ^2^ 

are equivaleiir to the conditions: 


(13) ; A ds 


^ S +, Bp reduces to 


n‘+). V (is _ 

% Ht " 


= . 


ds 

dt 


Put t = 0 and use 


. ^2 


•/" (0) = i<r 


t = 0 


and we have: 


t>0 = - A. I<r'' 






with 



4^ 



and since 


- n B 2 “ 


Bq * - I] B 2 ) (T and since B^ = __1 


a-, + a. 


Bo = (a^ - ^2 jja^ (n - 1) - a^j 


And this is condition (1) above. 


We integrate; (14) 


iA - 3 

for t = 0, = i(f ^ 

dt 


So Bo = - 




• ds _ ^2 

1 gij- - - <r 


i S(t) - - <f t + k 


/(t) = V + xJi 

/(t) = - I y^t^ + i/^t 


/(t) = i ^r^t^ + i/t't. 

2 

and this is the ch. fu. of a normal distribution. VJe next prove sufficiency 
of the assumption. 

We continue this technique of assigning different relationships be- 
tween the coefficients of Q and the regression coefficients; we get 
distribution^ 
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Research Activity 
Student Assistants 


Two upper level mathematics students in the College were 
research assistants on the project. The students are; 

Ms. Mary Murray 

Mr. Kendrick Henderson 

The students v;ere assigned to ascertain definitions and 
properties of the following transforms; 

Fourier Cosine 
Fourier Sine 
Laplace 

Inverse Laplace 
Mellins 

Inverse Mellins 
Stielties 

I 

Hilbert 


In my opinion, the investigation should be continued. The 
subject is quite broad and the many items of reference articles 
gathered to support the inquiry have not been utilized so far to 
any appreciable degree. 

We view the next step in a bnoad investigation to be that 
of extending Quadratic regression to cubic to quartic, etc. 

The number of relations, as utilized here, v;ill necessarily 
increase. 

The size of the sample, as demonstrated here, can be in- 
creased with the computational format make use of a computer 


program 
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